Is the Mean Enough to Describe a Dataset?

The mean is one of the most commonly used measures to summarize a dataset. While it provides useful information about the central tendency of the data, it doesn’t always tell the whole story. Different datasets can have the same mean yet exhibit vastly different distributions. In this example, we’ll explore two datasets with the same mean and analyze how they differ beyond that single statistic.

Example 1

Part A: Find the mean of the data set $ 76, 74, 83, 97, 70 $.
Part B: Find the mean of the data set $ 82, 81, 76, 85, 76 $.
Part C: Even though the means are the same, how are the data sets different?

Solution

Part A: The mean is the sum of all data values divided by the number of values in the set. The given data set is $ 76, 74, 83, 97, 70 $. \[ \begin{align*} \text{Mean} &= \dfrac{76 + 74 + 83 + 97 + 70}{{5}} \\\\ &= \dfrac{{400}}{{5}} \\\\ &= 80 \end{align*} \]
Part B: The given data set is $ 82, 81, 76, 85, 76 $. We follow the same process. \[ \begin{align*} \text{Mean} &= \dfrac{82 + 81 + 76 + 85 + 76}{{5}} \\\\ &= \dfrac{{400}}{{5}} \\\\ &= 80 \end{align*} \]
Part C:
Even though both data sets have the same mean, they differ in their distribution. The first data set has a wider range (from 70 to 97), indicating more variability, while the second data set has a smaller range (from 76 to 85), showing less variability.

$$\tag*{$\blacksquare$}$$

Conclusion

In conclusion, the mean alone is not always sufficient to fully describe a dataset. While it offers a measure of central tendency, other statistics such as range, variance, or standard deviation are essential to understand the data’s variability and distribution. As shown in this example, two datasets with identical means can represent very different patterns.